首页> 外文OA文献 >Boosting Trees for Anti-Spam Email Filtering
【2h】

Boosting Trees for Anti-Spam Email Filtering

机译:为反垃圾邮件过滤提升树木

摘要

This paper describes a set of comparative experiments for the problem ofautomatically filtering unwanted electronic mail messages. Several variants ofthe AdaBoost algorithm with confidence-rated predictions [Schapire & Singer,99] have been applied, which differ in the complexity of the base learnersconsidered. Two main conclusions can be drawn from our experiments: a) Theboosting-based methods clearly outperform the baseline learning algorithms(Naive Bayes and Induction of Decision Trees) on the PU1 corpus, achieving veryhigh levels of the F1 measure; b) Increasing the complexity of the baselearners allows to obtain better ``high-precision'' classifiers, which is avery important issue when misclassification costs are considered.
机译:本文介绍了一组针对自动过滤不需要的电子邮件消息的比较实验。 AdaBoost算法的几种变体具有置信度预测[Schapire&Singer,99],已被应用,它们在基础学习者的复杂性方面有所不同。从我们的实验中可以得出两个主要结论:a)基于增强的方法明显优于PU1语料库上的基线学习算法(朴素贝叶斯和决策树归纳),实现了很高水平的F1度量; b)增加基础学习器的复杂度允许获得更好的``高精度''分类器,这在考虑分类错误成本时通常是重要的问题。

著录项

  • 作者单位
  • 年度 2001
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号